Learning To Cooperate in a Social Dilemma: A Satisficing Approach to Bargaining
نویسندگان
چکیده
Learning in many multi-agent settings is inherently repeated play. This calls into question the naive application of single play Nash equilibria in multi-agent learning and suggests, instead, the application of give-andtake principles of bargaining. We modify and analyze a satisficing algorithm based on (Karandikar et al., 1998) that is compatible with the bargaining perspective. This algorithm is a form of relaxation search that converges to a satisficing equilibrium without knowledge of game payoffs or other agents’ actions. We then develop an M action, N player social dilemma that encodes the key elements of the Prisoner’s Dilemma. This game is instructive because it characterizes social dilemmas with more than two agents and more than two choices. We show how several different multi-agent learning algorithms behave in this social dilemma, and demonstrate that the satisficing algorithm converges, with high probability, to a Pareto efficient solution in self play and to the single play Nash equilibrium against selfish agents. Finally, we present theoretical results that characterize the behavior of the algorithm.
منابع مشابه
Modeling Cooperation between Nodes in Wireless Networks by APD Game
Cooperation is the foundation of many protocols in wireless networks. Without cooperation, the performance of a network significantly decreases. Hence, all nodes in traditional networks are required to cooperate with each other. In this paper, instead of traditional networks, a network of rational and autonomous nodes is considered, which means that each node itself can decide whe...
متن کاملModeling Cooperation between Nodes in Wireless Networks by APD Game
Cooperation is the foundation of many protocols in wireless networks. Without cooperation, the performance of a network significantly decreases. Hence, all nodes in traditional networks are required to cooperate with each other. In this paper, instead of traditional networks, a network of rational and autonomous nodes is considered, which means that each node itself can decide whe...
متن کاملLearning ε-Pareto Efficient Solutions With Minimal Knowledge Requirements Using Satisficing
Many problems in multiagent learning involve repeated play. As such, naive application of Nash equilibrium concepts are often inappropriate. A recent algorithm in the literature (Stimpson & Goodrich 2003) uses a Nash bargaining perspective instead of a Nash equilibrium perspective, and learns to cooperate in self play in a social dilemma without exposing itself to being exploited by selfish age...
متن کاملDilemmas and bargains: Autism, theory-of-mind, cooperation and fairness
Mentalising is assumed to be involved in decision-making that is necessary to social interaction. We investigated the relationship between mentalising and two types of strategic games those involving the choice to cooperate with another for joint gain or compete for own gain and those involving bargaining and division of a surplus in children and adults with and without autistic spectrum disord...
متن کاملEquilibrium computation of the Hart and Mas-Colell bargaining model
The 8-th problem raised by [Hart, S., Mas-Colell, A., 2010. Bargaining and cooperation in strategic form games. Journal of the European Economics Association 8 (1), 7–33], is solved. To be specific, I show that the set of SP equilibria can be determined by a finite number of systems of linear inequalities, which are efficiently solvable when there are two players. This is more or less surprisin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003